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by reference. 

Field of the Invention 

This invention pertains generally to wireless communication systems and methods 
and more particularly to a system and method for communicating using an improved 
estimate for multiple-input multiple-output wireless channel characteristics having 
compensation for phase noise and frequency offset. 
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BACKGROUND 

This invention relates to a method and apparatus for estimation of Multiple-Input 
Multiple-Output (MIMO) wireless channel characteristics. Training (or estimating) 
sequences are designed, transmitted and processed in such a way that the channel response 
is estimated for each transmitter, even if transmitters send training sequences 
simultaneously. The precision of the channel estimate is improved for channels with short 
impulse response by means of denoising. The negative effects of phase noise and 
frequency offset on channel estimation are minimized. The proposed training sequences 
are optimal or near-optimal from the viewpoint of mean-squared error in channel 
estimation for a given energy and duration of the training signal. Within above-described 
scope, several approaches to and designs of channel estimator are proposed. 

In a system that provides multiple radio-frequency (RF) channels existing within 
the same physical space, such as for example, a system that has two transmitting RF 
antennas transmitting to two receiving RF antennas, one would like to estimate the 
channel parameters for each spatial subchannel, i.e., for each transmitter-receiver pair. 
One example of a spatial subchannel parameter is the complex transmission coefficient. 
This subchannel parameter is approximately estimated for each transmitter and receiver 
pair. In particular, assume we are interested in a series of frequency subchannels, such 
as would be the case for a multi-carrier technique such as Orthogonal Frequency Division 
Multiplexing (OFDM). A complex transmission coefficient is estimated for each antenna 
pair and each frequency. 

With respect to FIG. 1 there is illustrated a diagrammatic illustration showing a 
simplified representation of the estimation of channel characteristics. Training sequences 
are transmitted sequentially for two antennas (antenna #A1, then antenna #A2). The 
transmitted training sequences are received by receiving antennas #A1' and #A2\ Thus, 
receiving antenna #AF and #A2' first receive the training sequence transmitted from 
transmitting antenna #A1, then from transmitting antenna #A2. All channel 
characteristics can then be estimated, utilizing known techniques. See for example 
chapter 2 in A. R. S. Bahai and B. R. Saltzberg, MultiCarrier Digital Communication, 
1 999 Kluwer Academic/Plenum Publishers, New York, ISBN 0-306-46296-6; and chapter 
5 in R. Van Nee and R. Prasad, OFDM for Wireless Multimedia Communications, Artech 
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House Publishers, Boston and London, ISBN 0-89006-530-6, incorporated herein by 
reference. Although this scenario provides a potentially operable approach, it has 
significant disadvantages in that it wastes time, is bandwidth inefficient, and wastes 
transmit power. 

Heretofore it has been known generally that the use of multiple transmit/receive 
antennas have a substantial benefit on the achievable data rate as compared to single 
transmit/receive antennas in multipath fading environments. It is also known that the 
error in channel estimation decreases with the increase in the energy transmitted during 



10 training, also referred to as the channel estimation stage. To improve bandwidth 

efficiency, it is desirable that the duration of training should be minimized. 

I Unfortunately, the ability to reduce the duration of training is in at least some ways 

•I s restricted by transmitter peak power limitations imposed by cost and/or technology 

Ili5 constraints. Potential coupling of channel outputs may also impose constraints on the 

extent to which the duration of estimation and training may be reduced. 



|d= There therefore remains a need for system and method that overcome these and 

^ other limitations. 

Ho 

SUMMARY 

The invention provides a method and apparatus for estimation of multiple-input 
multiple-output (MIMO) wireless channel characteristics. A method is provided for 
transmitting training sequence signals by a plurality of transmitting antennas such that 

25 the training subsequences are substantially orthogonal when received, through an arbitrary 

channel, at a plurality of receiving antennas. In one embodiment, the training sequence 
signals comprise subsequences which are mutually orthogonal at the receiver. The 
method provides an estimate of channel characteristics by receiving and processing said 
subsequences. The method thus provides an estimate of the channel characteristics for 

30 each transmitting antenna despite interference between transmitting antennas. 



One embodiment of the method provides for combining the estimates generated 
by processing the received subsequences to generate a refined channel estimate including 



A-70184/RMA/JML 

-4- 

an estimate of frequency offset and phase noise. Errors due to frequency offset and phase 
fluctuations are thus reduced. In one embodiment, interpolation and/or filtering of the 
channel estimate in frequency is provided to reduce estimation errors by exploiting 
redundancy in the frequency-domain representation of a channel with a short impulse 
response. 

The apparatus provides a plurality of transmitting antennas, a plurality of receiving 
antennas and a receiver configured to generate an estimate of the channel characteristics. 

In one embodiment, the receiver is configured to denoise the estimate of the 
channel characteristics. 

A method is provided for denoising an estimate of a wireless channel having a 
short impulse response. The method comprises estimating a frequency domain channel 
response, calculating a nontruncated time domain channel response comprising a first set 
of coefficients by performing a first transform-based procedure on the frequency domain 
channel response, truncating the nontruncated time-domain channel response by selecting 
certain of the first set of coefficients to generate a second set of coefficients that define 
a truncated time-domain channel response and calculating a denoised frequency-domain 
channel response by performing a second transform based-procedure on the truncated 
time-domain channel response. 

In one embodiment of the denoising method, the first transform-based procedures 
is an inverse Fourier transform and the second transform-based procedure is a Fourier 
transform. In another embodiment, the first transform-based procedure is an inverse Fast 
Fourier transform and the second is a Fast Fourier transform. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is an illustration showing an embodiment of channel estimation with time- 
domain transmission multiplexing. 

FIG. 2 is an illustration showing an embodiment of channel estimation with 
frequency-domain transmission multiplexing. 
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FIG. 3 is an illustration showing an embodiment of channel estimation with 
simultaneous transmission in temporal spatial and frequency-domains for a 2x2 system 
with 2 training blocks. 

FIG. 4 is an illustration showing an embodiment of channel estimation with 
simultaneous transmission in temporal, spatial and frequency-domains for a 2x2 system 
with 3 training blocks. The redundancy of training sequency is used to cancel relative 
frequency offset and phase noise between transmitter and receiver local oscillators. 

FIG. 5 is an illustration showing an embodiment of implementation of a 
communication system using a channel estimator with simultaneous transmission in 
temporary, spatial and frequency-domains for 2x2 system with 3 training blocks; where 
the redundancy of training sequence is used to cancel relative frequency offset and phase 
noise between transmitter and receiver local oscillators. 

DETAILED DESCRIPTION 

Exemplary embodiments are described with reference to specific configurations. 
Those skilled in the art will appreciate that various changes and modifications can be 
made while remaining within the scope of the claims. The discussion below is first 
directed toward general properties of a communication protocol and the manner in which 
one embodiment of the invention improves over conventional systems and methods, then 
toward demonstrating the general properties of optimal training sequences provided by 
certain embodiments of the invention, then discussing a single-input channel case for the 
sake of simplifying the discussion. Finally, attention turns toward the multiple-input 
multiple-output channel. 

In one embodiment of the invention, reduction and desirably minimization of 
channel estimation time (also referred to as training time) are accomplished within a 
transmitter peak transmit power limitation by sending training sequences simultaneously. 
Undesirably, such simultaneous transmission introduces the problem of later decoupling 
(or separating) channel outputs due to the different transmitters at the receiver, so that 
channel response can be calculated for each transmitter. This decoupling should desirably 
remain possible for wide and a priori unknown variations in the properties of wireless 
channels, and embodiments of the invention include several designs of training sequences 
with operable and in many instances optimal and near-optimal decoupling properties, so 
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that simultaneous transmission of training sequences by different transmitters is no longer 
problematic. 

One method to provide such channel estimation involves transmitting the training 
sequences simultaneously rather than sequentially, and assigning alternate channels to 
multiple blocks as illustrated in FIG. 2. For example, block "A" (even-numbered) 
channels to antenna #A1 , and block "B" (odd-numbered) channels to antenna #A2. In this 
case, antenna #A1' receives training sequences simultaneously from transmitting antennas 
#A1 and #A2, but the received training sequences can be separated by virtue of the fact 
they have different carrier frequencies. When channel parameter variation is slow relative 
to frequency, it is then possible to interpolate in the complex plane to obtain channel 
estimation for the frequencies not measured. See for example chapter 5 in R. Van Nee 
and R. Prasad, OFDM for Wireless Multimedia Communications, Artech House 
Publishers, Boston and London, ISBN 0-89006-530-6, incorporated herein by reference. 

An even better approach is to transmit the training sequence twice. In the first 
case, block A is transmitted by antenna #A1, and block B is transmitted by antenna #A2; 
while in the second case, block A is transmitted by antenna #A2, and block B is 
transmitted by antenna #A1 . The values of channel coefficients at frequencies which are 
absent in the training sequence for a given training block are interpolated based on the 
measured channel coefficients for the same training block, and then the interpolated 
values are compared with the measured values. Measured values are improved by taking 
into account interpolated values at the same frequency. In one embodiment, the post 
processing is or includes taking weighted averages of the measured and interpolated data. 

This approach provides a solution in the absence of significant delay spread in the 
channel but may not provide the desired precision of channel estimation for large delay 
spread, i.e., long impulse response of the channel. The desired precision may not be 
provided because of large interpolation errors at large delay spread in the channel, e.g., 
more than twenty per cent of the duration of the training block. 

For actual hardware implementations of transmitter and receiver local oscillators 
the phase offset between transmitter and receiver local oscillators typically depends on 
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time. This dependence manifests itself in zero-mean fluctuations (phase noise) 
superimposed on the average drift (frequency offset). Unless the typical variations in 
phase offset during training are small in comparison with the channel noise-to -signal 
ratio, phase noise and frequency offset deteriorate the quality of channel estimation. In 
the presence of frequency offsets or phase noise in the systems described above, the 
perfect orthogonality of the received signals is lost. 

An enhancement is therefore described that provide system and method for 
optimal or near-optimal channel estimation that preserves the orthogonality of received 
signals affected by frequency offset and/or phase noise. Training sequences, as well as 
techniques for identifying, constructing, and using these training sequences, are described 
that compensate for slow phase variations. Slow phase variations are variations in phase 
that have correlation time larger than eight inverse signal bandwidths. In at least one 
embodiment, slow phase variation is compensated by building the training sequence out 
of several successive subsequences, so that the phase offset remains approximately 
constant during each of the subsequences, and differences in phase offset for different 
subsequences are estimated and compensated to improve the quality of channel 
estimation. 

The mean-squared error in estimating a wireless channel with short impulse 
response is reduced by denoising techniques, in comparison with that for a wireless 
channel with longer impulse response, for the same duration and signal energy of 
training. A short impulse response wireless channel in this context would generally be 
an impulse response of duration less than one tenth of the duration of the block of the 
training sequence. By comparison, a long impulse response in this context would 
generally be an impulse response of duration more than one third of the duration of the 
block of the training sequence. This reduction in mean-squared error for a wireless 
channel with a short impulse response is possible because the number of degrees of 
freedom in the channel description increases with the length of the channel impulse 
response, so that shorter channel allows for larger training energy per degree of freedom, 
and therefore more precise estimate. 
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It will therefore be appreciated that in one aspect the invention improves over a 
conventional system and methods by providing a system and method for improving the 
bandwidth efficiency by reducing the duration of the channel training or estimation stage. 
In one embodiment, this improvement in bandwidth efficiency is achieved by estimating 
MEMO channel characteristics by transmitting training sequences simultaneously from a 
plurality of transmitting antennas. 

Attention is now directed toward some characteristics of communication channels 
over all frequency subchannels and a communication protocol utilizing data blocks, data 
frames, frame synchronizing sequence, cyclic prefixes, and training blocks; and to some 
general properties of an embodiment of training sequences in multiple-input multiple- 
output (MIMO) systems and methods over all frequency subchannels. These training 
sequences provide improvement over conventional sequences and are optimal or near- 
optimal. 

First, consider a vector wireless additive white Gaussian noise (AWGN) channel 
with transmitting and AV > A 7 ^ receiving antennas. Data is sent in blocks preceded 
by cyclic prefixes. The blocks are organized in frames. Each frame starts with a 
synchronizing sequence, followed by training blocks, and then data blocks. The channel 
is quasi-stationary, i.e., it has coherence time longer than eight times the duration of the 
frame transmission, or in somewhat less technical terms, changes little during 
transmission of each frame. The relationships between data blocks, data frames, frame 
synchronizing sequence, cyclic prefixes, and training blocks is illustrated in the figures 
(See for example FIG. 5.). 

If the length of the data block without prefix is L b!oclc) we need to know only the 
response of the channel at L block discrete tones or frequencies. This knowledge is achieved 
by splitting the training sequence into blocks of the same length as the number of discrete 
tones L block (plus cyclic prefixes of length L pref J. 

Method and procedure for optimizing or at least improving training sequences at 
both intrablock (micro) and interblock (macro) levels are now described. By intrablock 
or micro levels is meant within each training block of length L block . By interblock or 
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macro levels is meant by multiplying all the baseband components of a given training 
block by the same complex number of unit absolute value. 

A discussion of general properties of optimal training sequences in MIMO systems 
follows below. Properties discussed below are advantageously employed by the inventive 
apparatus and method, as discussed in later sections. 

For convenience of description, the remainder of the description is placed into 
several headings as follows: optimization of the spectrum of the training sequences, 
orthogonality properties of optimal training sequences in MIMO systems, chirp sequences 
as building blocks of training sequences, estimation of scalar channel in the presence of 
phase wander and frequency offset or phase noise denoising, estimation of a MIMO 
channel disregarding phase wander, estimation of a MIMO channel in the presence of 
phase wander, and exemplary communication system and architecture. These headings 
are provided for the convenience of the reader and do not otherwise restrict or limit the 
subject matter that is disclosed and it will be understood that aspects and elements of the 
various embodiments are described throughout the description. 

Optimization of the spectrum of the training sequences 

In this section, a single-input transmission is considered, the extension to multi- 
input transmission being apparent by extension and by further description hereinafter. 
Having accounted for cyclic prefix, we arrive at a periodic channel model in EQ. 1 : 

Y(w)=X(w)H(w)+N(w), (1) 

where w is one of L block equidistant discrete multi-tone (DMT) frequencies, X(w), Y(w), 
N(w), and H(w) are DMT components of input, output, noise, and channel transfer 

function, respectively. The estimator produces channel estimate H(w) . We use the 

mean-squared error (MSE) of the estimate: 



e ch =Z\n<.yv)-H(w) f 



(2) 
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as its figure of merit. It will be appreciated by those workers having ordinary skill in the 
art in light of the description provided herein that variations of this figure of merit and 
alternative figures of merit may be utilized. 

In this section, the transfer functions H(w) are assumed for purposes of ease of 
description to be mutually independent at different frequencies. Such independence may 
approximately hold for non-line-of-sight communication if the frequency difference is 
much larger than the delay spread. However, in at least one case of interest, when the 
block duration is much longer than the delay spread, the nearby frequency components 
of the channel response are strongly correlated, a dependence which is used to mitigate 
the effects of channel and phase noise as described in additional detail in later sections of 
this description. 

Without aprior knowledge of the channel and for mutually independent H(w), the 
minimum-mean-squared error (MMSE) channel is unbiased and given by the expression: 

with mean-squared-error (MSE) 8 ch given by the following expression: 

where N a is the average noise variance per complex dimension. Given the total signal 
power in the bloc, e ch is minimized by using X with flat spectrum fX(w) ^constant, such 
as XPRBS or chirp sequence. The second choice (use of a chirp sequence) is probably 
more convenient because any integer is possible of even L block . Note that if the 
estimating sequence has a non-flat spectrum, the error e ch is increased by the factor: 



E[/X(w)/ 2 JEfl//X(w)/ 2 J>L 



(5) 
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Orthogonality properties of optimal training sequences in MIMO systems 

So that the primary features of the inventive system, apparatus, and method are 
more easily understood, it is instructive to concentrate discussion on a flat channel case 
initially and then add generalization for the frequency-selective case as provided in later 
sections of this description. A discrete-time model is used for purposes of description for 
this flat channel case. 

The channel input is described by matrix X whose rows correspond to 
simultaneous inputs from different transmitters, while each column gives the input 
sequence in time for the corresponding transmitter. The channel output is described by 
matrix Y whose rows correspond to simultaneous outputs of different receivers, while 
each column gives the output sequence in time for the corresponding receiver. The input- 
output relationship is given by the expression: 



where TV is noise matrix, and H is the matrix of channel coefficients for each transmitter- 
receiver pair or Nj^ x Nj^ channel matrix. The elements of TV are assumed to be Gaussian 
independently identically distributed (i.i.d.) variables. 

Without a priori knowledge of the channel, the MMSE channel estimate is 
unbiased and given by the expression: 



where the matrix pseudo-inverse is defined by the first equality in Eq. (7). The MSE of 
this channel estimate is: 



Y=XH+N, 



(6) 



H=(X H Xy l X H Y=pinv(X)Y, 



(7) 



e ch ^ Tr[(H - H){H-H) H }= N a Tr[(X n 'X) l ]= N 7~> 



(8) 
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where N a is the average noise variance per complex dimension, and {Aj are the singular 
values of matrix X**X sorted in the decreasing order. The notation "Trf...J" refers to 
taking the sum of the diagonal elements of the matrix. Note also that X" is the complex 
conjugate also known as Hermitian adjoint of X and X is advantageously a chirp 
sequence, as described above. 

Given the total signal power in the block TrfXX 1 ], £ ch is minimized for equal 
singular values of X"X, for example, when X^Xis proportional to the unit matrix. By init 
matrix, it is meant a square matrix which has unit diagonal elements and zero off-diagonal 
elements. In other words, the channel inputs from different transmitters (i.e., the columns 
of X) should be mutually orthogonal and have equal average power during channel 
estimation, in order to minimize e ch . 

It is straightforward to generalize the foregoing discussion to non-flat channels 
whose (matrix) transfer functions depend on the frequency of each multitone, as in Eq. 
(1) are mutually independent at different frequencies, thereby permitting independent 
estimation of different H(w) (See also the above subsection describing Optimization of 
the spectrum of the training sequences). The input-output relationship is given by the 
expression: 

Y(w)=X(w)H(w}+N, (9) 

where N is noise matrix (assumed for purpose of ready description to be independent of 
frequency), and H(w) is the x channel matrix. The resulting channel estimate is: 

H(w)=(X H (w)X(w)y 1 X{wf Y=pinv(X(w))Y, (10) 

with MSE 

* k=\^Aw) 
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Given the total energy of the estimating sequence 



YTr[XX H ] 



(12) 



the error e ch is minimized, when the SVD values A k (w) are the same for different k and w. 



that is, for example, when )FX is proportional to the unit matrix. In other words, the 
frequency components of channel inputs from different transmitters (i.e., the columns of 
X(w) should be mutually orthogonal and have equal energy. If the training sequences do 
not satisfy this requirement, the estimation error 8 ch increases by the factor: 



where averaging occurs over both k and w indexes. 

For the optimal training sequences and AWGN channel, the estimation error is 
given by the ratio of the noise energy per degree of freedom N a to the signal energy per 
degree of freedom E„, that is E D /N 0 . This simple rule applies no matter if the transmitter 
has single or multiple antennas, and regardless of whether the channel is flat or frequency- 
selective. 

Some examples of optimal and near-optimal or improved training sequences for 
single-input channels that apply to MIMO channel estimation are now described. 

There are many types of optimal and near-optimal training sequences, for which 
Eq. (13) Aj/w) = const holds or at least approximately holds. However, only a few of 
such sequences are practical, at least in part because of one or more of transmitter peak 
power constraints, phase noise, and other restrictions or limitations. 



A k (w) = const, 



(13) 



E[A k (w)]E[V A k (w)]>l, 



(14) 
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Due to transmitter peak power constraints, it is hardly practical to use sequences 
with strongly nonuniform time distribution of signal power, such as short pulses, because 
the power amplifiers needed to output the required peak power are (at present) 
prohibitively expensive. In particular, the peak power is reduced if all the transmit (TX) 
antennas transmit simultaneously during the training sequence. 

Orthogonality properties of estimating sequences should not be significantly 
affected by slow changes in phase offset between receive (RX) and transmit (TX) local 
oscillators. Such changes occur or at least may occur due to carrier frequency offset 
between the transmitter and receiver local oscillators, as well as by low-frequency phase 
noise. Compensating for any clock frequency offset and/or clock jitter is not discussed 
here but are known in the art, see for example chapter 5 in A. R. S. Bahai and B. R. 
Saltzberg, MultiCarrier Digital Communications, 1999 Kluwer Academic/Plenum 
Publishers, New York, ISBN 0-306-46296-6, incorporated herein by reference. 

If the training sequences are sufficiently long, so that the TX-RX phase offset 
changes considerably during the training (even after any frequency offset compensation 
has been applied), then extra techniques should desirably be used to compensate for the 
remaining fluctuations in phase offset on channel estimation, which affects the design of 
the training sequences. An example of extra technique is adjusting the phases of different 
blocks of the training sequence to compensate for variable phase offset between TX and 
RX local oscillators. For example, a training sequence may advantageously be built of 
successive training subsequences, each of which subsequence suffices to calculate a 
channel response, albeit with less precision than the whole training sequence comprised 
of the plurality of successive subsequences. 

By comparing the partial channel estimates due to each subsequence, information 
about slow phase fluctuations is extracted and used to compensate for the effects of phase 
noise and frequency offset on channel estimation. 
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Chirp sequences as building blocks of training sequences 

Chirp sequences have a form given by the expression: 

W*]°cexp(*y* 2 /r^), (15) 



where integer T chirp is the period of the sequence, and index k runs through any T cMrp 
consecutive numbers, for example, k = 0, 1, (T cMrp - 1). The 7^-point Fourier 
transform of the time-domain chirp gives frequency domain chirp, for example, the 
discrete-frequency power spectral density (PSD) of the chirp is constant. These properties 
of uniform energy distribution in both time and frequency domain minimize transmitter 
peak power requirement and estimation error, respectively. 

For a single-antenna TX, the whole training sequence may be obtained by 
concatenating cyclically prefixed chirps. A cyclically prefixed chirp x chip [k] extended to 
negative k = -L pitGx , -L^ + 1, ... , -1, where is the length of the cyclic prefix, such 
as for example, x chiip [k] for k = -10, -9, ... , L block . For example, in orthogonal frequency- 
division multiplexing (OFDM) system with L block discrete multi-tones (DMTs), chirps 
with period T chjrps = L bhck may be used. The resulting training sequence estimates the 
channel at all DMT frequencies. 

Estimation of scalar c hannel in the presence of phase wander and frequency offset or 
phase noise denoising 

To illustrate how one embodiment of the inventive system, apparatus, and method 
may be used to improve estimation precision in the presence of phase noise, consider the 
following example. Let the training sequence consist ofN block cyclically prefixed chirps. 

Let H w (w) stand for a channel estimate derived from the mth block alone. Let the low- 
frequency phase noise be approximated by blockwise-constant process, that is, a process 
in which the TX-RX phase offset is constant during each training block. 

There are two possible sources of discrepancy between different estimates of 
channel H m (w). First, additive channel noise results in random deviations of partial 
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channel estimates from each other. Second, slow phase fluctuations (also referred to as 
phase wander), which occur on time scales much larger than the block time, result in 

random rotations of : 



H m {w)-> H m (w)e" f \ (16) 



rotations that are the same at different frequencies for a given i. 



To obtain an optimal or nearly optimal channel estimate, we combine h;(w) as 

follows. First, we estimate phase differences <p m -<p D , where m = 1,2, ... , N block -1 and n 
= N biook, as follows: 



exp[j(0-d))=-^ ; , (17) 



which reduces to: 

for small phase differences, <f> m -0„«l. 

To compensate for phase wander, partial channel estimates are combined as 



follows: 



H?X*~ leJi ~'~ + ' u °* ) H(w)+H N (w) 
H(w)=±^ — ^lL^ (19) 



and the channel is modeled as H(w) multiplied by a block-constant phase factor. 



Estimation of the block-constant phase factor depends on details of the phase noise 
model. In particular, if phase wander on the time scale of frame length is well 
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approximated by constant frequency offset, the phase factor is obtained by extrapolation 
of the best linear fit to <j> m ~^ Nblock • The precision of channel estimation may be further 
improved by decision-aided tuning of phase factor during data transmission. 

Estimation of a MIMO Channel 

The proposed system, apparatus, and method of efficient estimation of a MIMO 
channel is closely related with the estimation techniques for a single-input channel 
discussed above. However, it is desirable to take into account and cancel the interference 
among different transmitters during the estimation stage. 

In one embodiment, this cancellation is achieved by choosing training sequences 
for different transmitters (channel inputs) to be not only mutually orthogonal but also to 
produce mutually orthogonal channel outputs, regardless of a particular channel 
realization. For linear-time-invariant or quasi-time-invariant channels, both input and 
output orthogonality occurs for training sequences based on either pure frequency division 
multiplexing or hybrid time-frequency division multiplexing among different transmitters. 
(Pure time division multiplexing strongly increases requirements on the maximum output 
power.) 

Estimation of a MIMO channel disregarding phase wander 

One of the challenges in estimating a MIMO channel is to obtain separate channel 
responses for different transmitters, while maintaining simultaneous transmission during 
training (to satisfy peak power constraints). 

To this end, transmitters can be separated in the frequency domain as follows. 
Given the number of transmitters N TX , choose integer p > N m , such that 2p divides the 
number of multi-tones L block . During each training block, each transmitter uses a training 
sequence X p q \X\ with a different q, and channel response for each is estimated only at 
the corresponding "on-" frequencies (w p J where Xp q [k] is given by: 
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x P , q l k ] cc ex P *u 



pk 2 + 2qk 



(20) 



'block 



where p and q are integer, and 2p divides L block and w p q is given by: 




_2n_ 

L blocl 



[np+q), 



(21) 



where n is integer. 

Channel responses at the rest of the frequencies are obtained either through frequency- 
domain interpolation, or by using for each transmitter indexes q which depend on the 
training block, so that for each transmitter <? runs through all possible values O, ...,(p- 1). 

A related technique to separate channel responses to different transmitters is to 
properly phase training blocks for different transmitters and block positions, for example, 
as follows. Let there be N block > N m training blocks indexed with t =0, {N block - 1). 
During block t, w-th antenna sends sequence 



Let Y[t,k] be the received signal described by vector sequence. Then the channel output 
due to the chirp transmitted by a single antenna m is given by the m-th component of the 
Fourier transform of Y[t,k] over the first index (t\ from which the channel response for 
transmitter m is reconstructed by methods for a single-input channel, such as those 
described in above sections beginning "Estimation of scalar channel in the presence of 
phase wander and frequency offset or phase noise denoising". 



x m [t,k]=e-xp(2xjmt/N block )x chirp [k] . 



(22) 



For example, in one embodiment, the invention provides system, apparatus, and 
method for transmitting both (or multiple) signals A and B simultaneously (or 
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substantially simultaneously) and providing system, apparatus, and method for extracting 
individual antenna transmissions from the simultaneously transmitted (and subsequently 
received) signals. In the case where there are two transmitting antennas, N TX =2, one 
embodiment of the invention according to the method described above is to transmit two 
training sequences during two blocks. In this case N block =N TX . In the first training 
sequence, a signal with equal power spectral density over both channels is transmitted, 
as determined to be the optimal sequence structure in Eq. (22) for m = 0, 1, t = 0, 1, and 
N block = 2'. 

One embodiment of the inventive system, apparatus, and method use a simple "chirp" 
signal for the first signal that varies in time over the frequency band. In the second 
training sequence, the phase of the transmission is inverted over the second antenna 
(antenna #A2), as indicated in Eq. (22) x m [t,k] demonstrates this - antenna 2 (m=l) 
inverts only the second training block (t=l). An embodiment of this technique is 
illustrated in FIG. 3. 

Using this approach, the channel coefficients or parameters corresponding to 
antennas #A1 and #A2 can then be extracted from the received signals by taking the 
difference and sum of the consecutive training sequences, respectively. Let the received 
blocks be given by v„[t,k], where n = 0, 1 stands for the index of the receiving antenna, and 
/ and k are defined above. Then the elements H^ a (co) of the channel matrix //(g>) is given 
by: 

H 0 , n (a>)= (Y„[0,®]+ Y n [l,a])/2X 0 [0,a)] 
#i, M (<y)= ™<y]- Y n [l,a)])/2X 0 [0,CD] 

where XJt,G>] is the FFT of xjt,k] over the second index in the brackets, and Y n [t,G>] is 
the FFT ofj n [t,k] over the second index in the brackets. The solution can be generalized 
to higher numbers of antennas (such as for example 3, 4, 5, 6, or more antennas) by using 
more general phase factors according to the following equation: 
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-1 



exp(-2zjmt/N Uock ) 



For example, for three antennas and three training blocks, the system and method 
transmits three sequences, with the phases differing by (271/3). This would be reflected 
in Eq. (22) as N block = 3 case: 



A second embodiment of the invention provide additional enhancements that are 
more robust in the to the presence of frequency offsets or phase noise, where the afore 
described inventive and conventional approaches may loose the perfect orthogonality of 
the received signals in the presence of such frequency offsets or phase noise. This loss 
of perfect orthogonality is disadvantageous as it results in significant increase in the bit 
error rate to the extent that operation is sometimes impossible. 

Therefore in the second embodiment, the phase noise and/or frequency offset 
problem is reduced or eliminated by sending additional information to allow extraction 
of additional phase noise and frequency offset parameters. Referencing Eq. (22), this 
represents the case where N block > N TX . These additional parameters permit cancellation 
of phase noise and frequency offset so that given bit error rate is achievable at 
considerably lower transmitted power. 

The inventive system, apparatus, and method proceed in the two antenna case as 
if there were a third antenna: that is, instead of using two transmissions with a relative 
phase of -1 (inverted phase) on antenna #A2, the approach uses three training sequences 
in succession with relative phases of (1, a, a 2 ) as illustrated in FIG. 4. In one preferred 
embodiment, a = exp(27Uj/3) though other functions for "a" may alternatively be used so 
long as Eq. (22) or its equivalent is satisfied. One then proceeds as described above and 
in even greater detail hereinafter, by extracting a "third antenna channel" as if a third 
antenna was transmitting. This third antenna channel will be identically 0 (zero) in the 
absence of additive channel noise, phase noise, and frequency offset. Local oscillator 
(LO) frequency offset, when present, will show up as an extra signal with magnitude 
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varying linearly with time, a signal which contains "third-channel" components; and both 
phase noise and additive channel noise will show up as a stochastic "third-channel" 
signal. Thus from the degrees of freedom present in the received signal that are unused 
by the transmitter, the inventive system and method permit extraction of information 
about phase noise and frequency offsets, and correctly estimate the channel. 

Estimation of a MIMO channel in the presence of phase wander 

Analogous to the case of a single-input channel, phase wander results in relative 
phase rotations of channels subestimates obtained during different time periods of 
estimating stage. To improve precision of channel estimation, such phase rotations 
should desirably be estimated for and compensated. Moreover, the training sequence 
itself should desirably be designed so that to have modularity in time, for example, the 
training sequence estimate should desirably be composed of several subsequences, each 
of which is located within a certain period of estimating stage and is sufficient to calculate 
a channel response, although with less precision than the whole or entire training 
sequence. 

Depending on the relationship between the time scale of phase wander r ph , block 

time T block = L b!ock T sym , training time T training = N block T bloclo and the number of transmitters 
N m , different schemes of phase compensation apply. 

For very slow phase wander f ph »T trajning , no phase compensation is necessary. 
For somewhat faster phase wander, when 

^ph ~ ^training 

butr ph »N TX T block , (23) 

the training subsequences can be identical composed of training blocks each, and 
given by Eq. (22), where N block is substituted by N TX , and index t stands for the position in 
the sw&sequence. The resulting training subsequences are combined according to the 
techniques described in above sections, including "Estimation of scalar channel in the 
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presence of phase wander and frequency offset or phase noise denoising", and modified 
to allow for matrix form of the transfer function. In particular, Eqs. ( 1 7) and (18) become 



and 



respectively. 



~ M '--'-*T..iHH m ww (24) 



Xjm(rr[/f.(^f(w)3) 

9 m -fn = ^ ~ — - H ~ , (25) 



Exemplary Communication System and Architecture 

One embodiment of a communication system, and component transmit and 
receive portions, according to the invention is illustrated in FIG. 5. In the illustrated 
embodiment, the wireless communication system includes a 2-antenna transmitter (TX). 
Each of the two transmitter branches of which includes the following elements: an 
in-phase/quadrature-phase (I/Q) modulator, an intermediate frequency (IF) amplifier, a 
bandpass filter, a mixer, a radio frequency (RF) preamplifier, a bandpass filter, a power 
amplifier, and a transmitting antenna. In this particlar embodiment, the mixers in both 
branches of TX are driven by the same local oscillator with 1/f 2 spectrum of the phase 
noise. In one embodiment, the single-sideband (SSB) spectral density of the phase noise 
is 108 dBc/Hz at 100 KHz frequency offset. This magnitude of phase noise is for 
example, a typical number for monolithic implementations of a microstrip oscillator. The 
separation between the TX antennas is assumed for purposes of this example to be two 
wavelengths at the carrier frequency. 



The system also includes a 2-antenna receiver (RX) ), each of the two branches 
of which includes the following elements (left to right): a receiving antenna, a pre-filter, 
a low-noise amplifier (LNA), a bandpass filter, a second-stage amplifier, a mixer, a 
bandpass filter, a third-stage amplifier, and an I/Q demodulator. In this particular 
embodiment, the mixers in both branches of RX are driven by the same local oscillator 
with single-sideband (SSB) phase noise of 108 dBc/Hz at 100 KHz frequency offset. 
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Again, for purposes of this example the separation between the antennas is assumed to be 
two wavelengths at the carrier frequency. 

In this embodiment, 1024 discrete multitones (DMT) are used for data 
transmission. Other numbers of DMT may be used in other embodiments. The 
non-line-of sight wireless channel described by Rayleigh fading with all the elements of 
channel matrix independent identically distributed variables with zero mean and unit 
standard deviation. The length of the channel response in the time domain equal 32 
symbol periods, and the length of the cyclic prefix (CP) is 64 symbol periods. 

In this exemplary embodiment, channel estimation lasts for three training blocks. 
For the upper TX branch, each training block is a (cyclically prefixed) chirp sequence. For 
the lower TX branch, each training block is a (cyclically prefixed) chirp sequence 
multiplied by the phase factor shown in the lower left portion of FIG. 5. 

For this particular embodiment, the average signal-to-noise ratio (SNR) at the 
receiver is about SNR=32dB. The signal bandwidth is 3MHz, so that the corresponding 
symbol period in this exemplary embodiment is 0.333 microseconds. The following four 
situations were simulated as validation of aspects of the inventive system and method 
each for 10000 channel realizations. 

In a first situation, a conventional channel estimator (such as a channel estimator 
described in Chapter 5 in A. R. S. Bahai and B. R. Saltzberg, MultiCarrier Digital 
Communications, 1999 Kluwer Academic/Plenum Publishers, New York, ISBN 
0-306-46296-6) is assumed where there is a zero frequency offset between the transmit 
local oscillator (TX LO) and the receiver local oscillator (RX LO). The figure of merit for 
the channel estimator (in terms of the rot mean square (rms) error of the channel 
estimation) for this first situation is a relative rms error = 0.0557. 

In a second situation, a conventional channel estimator (such as a channel 
estimator described in Chapter 5 in A. R. S. Bahai and B. R. Saltzberg, MultiCarrier 
Digital Communications, 1999 Kluwer Academic/Plenum Publishers, New York, ISBN 
0-306-46296-6, incorporated herein by reference) is assumed where there is a frequency 



A-70184/RMA/JML * " ' 

-24- 

ofFset between TX LO and RX LO is assumed to have a Gaussian distribution with zero 
mean and standard deviation of 300Hz. The figure of merit for the channel estimator (in 
terms of the rot mean square (rms) error of the channel estimation) for this second 
situation is a relative rms error = 0. 106. 

In a third situation, an embodiment of the inventive channel estimator described 
herein is assumed along with a zero frequency offset between TX LO and RX LO. The 
figure of merit for the channel estimator (in terms of the rot mean square (rms) error of 
the channel estimation) for this third situation is a relative rms error = 0.017. 

Finally, in a fourth situation, an embodiment of the inventive channel estimator 
described herein is assumed along with a frequency offset between TX LO and RX LO 
having a Gaussian distribution with zero mean and standard deviation of 300Hz. The 
figure of merit for the channel estimator (in terms of the rot mean square (rms) error of 
the channel estimation) for this fourth situation is a relative rms error = 0.019. 

The comparison of the numbers for the conventional (situations 1 and 2) and 
inventive (situations 3 and 4) channel estimator show that the inventive system and 
method provide significantly better precision, especially in the presence of frequency 
offset (for example situation 2 versus situation 4). 

It will be appreciated that the afore described method may be utilized in 
conjunction with a wireless communication system having a plurality of transmitters and 
a plurality of receivers. It will also be appreciated that aspects of the inventive 
methodology may be separately incorporated into one or more transmitter elements and/or 
one or more receiver elements. Furthermore, the methods described herein may be 
implemented in hardware, software, firmware, or any combination of these so that the 
invention may for example include a specialized analog or digital signal processor or 
processing unit. The invention also provides a computer software program and a 
computer software program product to the extend that one or more procedures described 
herein are implemented as computer software instructions for execution in either a general 
purpose computer or in specialized computer or processing hardware or system. 



